Deep Learning for Activity Recognition Using Audio and Video

نویسندگان

چکیده

Neural networks have established themselves as powerhouses in what concerns several types of detection, ranging from human activities to their emotions. Several analysis exist, and the most popular successful is video. However, there are other kinds analysis, which, despite not being used often, still promising. In this article, a comparison between audio video drawn an attempt classify violence detection real-time streams. This study, which followed CRISP-DM methodology, made use models available through PyTorch order test diverse set achieve robust results. The results obtained proved why has such prevalence, with classification handily outperforming its counterpart. Whilst attained on average 76% accuracy, secured scores 89%, showing significant difference performance. study concluded that applied methods quite promising detecting violence, using both

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining pattern recognition and deep-learning-based algorithms to automatically detect commercial quadcopters using audio signals (Research Article)

Commercial quadcopters with many private, commercial, and public sector applications are a rapidly advancing technology. Currently, there is no guarantee to facilitate the safe operation of these devices in the community. Three different automatic commercial quadcopters identification methods are presented in this paper. Among these three techniques, two are based on deep neural networks in whi...

متن کامل

Named Entity Recognition in Persian Text using Deep Learning

Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...

متن کامل

Cartoon-recognition using video & audio descriptors

We present a new approach for classifying mpeg-2 video sequences as ‘cartoon’ or ‘non-cartoon’ by analyzing specific video and audio features of consecutive frames in real-time. This is part of the well-known video-genreclassification problem, where popular TV-broadcast genres like cartoon, commercial, music, news and sports are studied. Such applications have also been discussed in the context...

متن کامل

Deep Learning Architectures for Face Recognition in Video Surveillance

Face recognition (FR) systems for video surveillance (VS) applications attempt to accurately detect the presence of target individuals over a distributed network of cameras. In video-based FR systems, facial models of target individuals are designed a priori during enrollment using a limited number of reference still images or video data. These facial models are not typically representative of ...

متن کامل

Deep video gesture recognition using illumination invariants

In this paper we present architectures based on deep neural nets for gesture recognition in videos, which are invariant to local scaling. We amalgamate autoencoder and predictor architectures using an adaptive weighting scheme coping with a reduced size labeled dataset, while enriching our models from enormous unlabeled sets. We further improve robustness to lighting conditions by introducing a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Electronics

سال: 2022

ISSN: ['2079-9292']

DOI: https://doi.org/10.3390/electronics11050782